07/20/2004 12:09 GENENCOR LEGAL 1V038729306 N0 ia5 


PROTEINS: Structure, Function, and Genetics 23:301-317 (1995) 


A Critical Assessment of Comparative Molecular 
Modeling of Tertiary Structures of Proteins* 

Steven Mosimann, Ron Meleshko, and Michael N.G* James 

Medical Research Council of Canada, Group in Protein Structure and Function, Department of Biochemistry, 
University of Alberta, Edmonton, Alberta T6G 2R7, Canute 


ABSTRACT In spite of the tremendous in- 
crease in the rate ait which protein structures are 
being determined, there is still an enormous gap 
between the numbers of known DNA-derived 
sequences and the numbers of three-dimen- 
sional structures. In order to shed light on the 
biological functions of the molecules, research- 
ers often resort to comparative molecular mod- 
eling. Earlier work has shown that when the 
sequence alignment is in error, then the com* 
parative model is guaranteed to be wrong. In 
addition, loops, the sites of insertions and dele- 
tions in families of homologous proteins* are ex- 
ceedingly difficult to model Thus, many of the 
current problems in comparative molecular 
modeling are minor versions of the global pro- 
tein folding problem. In order to assess objec- 
tively the current state of comparative mole- 
cular modeling, 13 groups submitted blind 
predictions of seven different proteins of un- 
disclosed tertiary structure- This assessment 
shows that where sequence identity between the 
target and the template structure is high (> 70%), 
comparative molecular modeling is highly suc- 
cessful. On the other hand, automated modeling 
techniques and sophisticated energy minimiza- 
tion methods fail to improve upon the starting 
structures when the sequence identity is low 
(*-30%). Based on these results it appears that 
insertions and deletions are still major prob- 
lems* Successfully deducing the correct se- 
quence alignment when the local simflarHy is 
low is still difficult- We suggest some minimal 
testing of submitted coordinates that should be 
required of authors before papers on compara- 
tive molecular modeling are accepted for pub- 
lication in journals. © 1995 Wiley-Li^, Inc. 
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INTRODUCTION 

Once a protein's sequence has been determined and 
it has been found to be a new member of a structur- 
ally characterized protein family, it is relatively 
straightforward to build a molecular model of the 
protein using a set of simple guidelines. 1 * 7 Presently, 
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there are several commercial and public domain com- 
puter programs that have been developed for mod- 
eling; these programs remove much of the tedium 
from the process- There are numerous reasons for 
constructing comparative molecular models of pro- 
teins. The molecular model may explain the struc- 
tural basis of existing experimental results and can 
provide one with structural information on which 
further experiments can be planned, executed, and 
evaluate*! Site-specific mutations of the gene coding 
for the specific protein can provide important data 
regarding the protein's function. Perhaps, some of 
the most revealing experiments are those designed to 
predict and to probe the molecular reasons for an 
enzyme's specificity.' On a more practical note, the 
molecular model can sometimes be used successfully 
to determine phases for a crystal structure determi- 
nation using the method of molecular replacement. 4 
The more spectacular uses, however, are typified by 
the recent successful application of comparative mo- 
lecular modeling for identifying new classes of lead 
compounds in antimalarial drug development. 5 

An example of the successful prediction of an en- 
zyme's specificity from comparative molecular mod- 
eling. is that for granzyme B (CCPl), a serine pro- 
teinase from cytotoxic T lymphocytes, 3 A molecular 
model of CCPl (48% identical to rat mast cell pro- 
teinase H) showed that an arginine at position 226 
would occupy the fij specificity pocket, thereby sug- 
gesting a P x specificity for an aspartate or ghita- 
mate residue. Subsequent synthesis and testing of a 
series of substrates differing in the nature of the PI 
residue confirmed the aspartate specificity of 
CCPl. 8 The Pi specificity of CCPl has recently been 
altered by site-specific mutagenesis of the residue at 


*Tbis assessment does not indicate that any one particular 
modeling group or modeling technique is superior to any other. 
We 4o not believe that comparative molecular models can be 
ranked using a single or even several numeric indicators. As 
such, claims that particular modeling techniques are superior 
based upon the results herein are not justifiable, in our opin- 
ion. 
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